Query-Based Keyphrase Extraction from Long Documents
نویسندگان
چکیده
Transformer-based architectures in natural language processing force input size limits that can be problematic when long documents need to processed. This paper overcomes this issue for keyphrase extraction by chunking the while keeping a global context as query defining topic which relevant keyphrases should extracted. The developed system employs pre-trained BERT model and adapts it estimate probability given text span forms keyphrase. We experimented using various sizes on two popular datasets, Inspec SemEval, large novel dataset. presented results show shorter with longer one without documents.
منابع مشابه
Query-Oriented Keyphrase Extraction
People often issue informational queries to search engines to find out more about some entities or events. While a Wikipedia-like summary would be an ideal answer to such queries, not all queries have a corresponding Wikipedia entry. In this work we propose to study query-oriented keyphrase extraction, which can be used to assist search results summarization. We propose a general method for key...
متن کاملPositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents
The large and growing amounts of online scholarly data present both challenges and opportunities to enhance knowledge discovery. One such challenge is to automatically extract a small set of keyphrases from a document that can accurately describe the document’s content and can facilitate fast information processing. In this paper, we propose PositionRank, an unsupervised model for keyphrase ext...
متن کاملKeyphrase extraction through query performance prediction
Previous research shows that keyphrases are useful tools in document retrieval and navigation. While these point to a relation between keyphrases and document retrieval performance, no other work uses this relationship to identify keyphrases of a given document. This work aims to establish a link between the problems of Query Performance Prediction (QPP) and keyphrase extraction. To this end, f...
متن کاملA Distributed Framework for NLP-Based Keyword and Keyphrase Extraction From Web Pages and Documents
The recent growth of the World Wide Web at increasing rate and speed and the number of online available resources populating Internet represent a massive source of knowledge for various research and business interests. Such knowledge is, for the most part, embedded in the textual content of web pages and documents, which is largely represented as unstructured natural language formats. In order ...
متن کاملTopical Keyphrase Extraction from Twitter
Summarizing and analyzing Twitter content is an important and challenging task. In this paper, we propose to extract topical keyphrases as one way to summarize Twitter. We propose a context-sensitive topical PageRank method for keyword ranking and a probabilistic scoring function that considers both relevance and interestingness of keyphrases for keyphrase ranking. We evaluate our proposed meth...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... International Florida Artificial Intelligence Research Society Conference
سال: 2022
ISSN: ['2334-0762', '2334-0754']
DOI: https://doi.org/10.32473/flairs.v35i.130737